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@ The isolation of the yeast o-f actor genes ts described. The 
promoter and signal peptide portions are isolated and joined 
to DNA coding for proteins heterologous to yeast in a ptasmid 
which is used to transform yeast cells. The yeast expresses 
the heterologous DNA and processes and secretes the 
heterologous protein. 
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Docket No. 100/175 



PROCESS FOR EXPRESSING HETEROLOGOUS PROTEIN IN 
YEAST, EXPRESSION VEHICLES AND YEAST ORGANISMS 

THEREFOR 



Hiis c^jplicaticxi is irelated to US applications Serial No. 
06/438,236 (EP 88632), and Serial No. 06/488,337 filed 25 ;^ril 1983 
(European cqpplication filed an even date), the disclosures of 
vjhich are hereby incorporated by reference. 



Field of the Invention 

This invention is directed generally to recombinant ONA 
technology utilizing yeast host systems and expression vehicles that 
produce, process and secrete heterologous protein as discrete 
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product unaccompanied by interfering amounts of unwanted presequence 
or other artifact of expression. 

Proteins that are secreted through the cell roerabrane of the 
parent cell are ordinarily produced in the cell as a "pre'-protein. 
In that fonn, the protein is fused to an additional polypeptide 
sequence which presumably assists its secretion and localization- 
This additional protein, referred to as a "signal" polypeptide,- is 
believed to be clipped from tte secreted "nature" protein during the 
secretion process. Although the signal peptides of pre-proteins 
sharfe some similarities, their primary structures differ 
considerably. The signal peptides even for a given organism exhibit 
this variation. For example, the signal for human growth honnone is 
substantially different froo the signal for human insulin. This 
suggests that each protein has evolved with a signal sequence which 
is particularly well suited for translocation of that pari:icular 
protein through a cell membrane. 

This invention is based upon the discovery that a substantially 
mature protein is produced and often secreted fay yeast wten the DNA 
coding for the heterologous protein is operably attached to the DKA 
sequence of the pranoter and/or signal peptide encoding portions of 
the yeast a-f actor gene. (It will be apparent from the disclosure 
herein that yeast harbors at least two alpha factor genes. The use 
of "the alpha factor gene" is intended to include all such 
functional genes.) Thus, in a primary aq)ect, this inventicn is 
directed to the means and methods of obtaining useful quantities of 
heterologous protein from the medium of a yeast culture containing 
viable cells harboring expression vehicles containing DNA encoding 
the desired protein, wherein the DNA coding for this heterologous' 
protein is operably connected to a DMA sequence conq)rising the 
promoter and/or signal portion of the yeast a-factor gene. Of 
enormous advantage is the enablement, by this invention, of 
obtaining useful, discrete protein product in the cell culture 
medium by expression of heterologous DNA in an easily modified 
plasmid. 
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The alpha factor of yeast contains a "pre-pro" sequence which is 
ordinarily removed from the a- factor upon the completed act of 
secretion. Operationally, therefore, the pre-pro sequence functions 
as a signal sequence in the process of secretion into the medium as 
will be further explained below* It is clear from the results 
obtained herein that the peptide which comprises the thus-defined 
signal sequence of alpha factor fused to a heterologous protein is 
successfully processed by the yeast organism so as to result in. the 
secretion of the mature heterlogous protein into the surrounding 
medium. Therefore, the advantages obtained by use of this "pre-pro" 
signal are realized whether or not the expression of the 
signal /heterologous protein gene sequence is under the control of 
the alpha factor promoter or under the control of other promoters 
which are functional in yeast. Similarly, the results obtained 
demonstrate that the alpha factor promoter is effective in 
expressing the heterologous gene, and that such expression could be 
obtained without the intermediate insertion of the signal sequence 
into 'the expression vehicle. Accordingly, this invention is 
directed to the use of alpha factor promoter qua promoter in yeast 
systems for the expression of heterologous peptides and to the use 
of the alpha factor signal qua signal as a means for effecting 
processing and secretion of heterologous proteins produced as a 
result of expression in yeast. 

The publications and other materials referred to herein to 
illuminate the background of the invention, and in particular cases, 
to provide additional detail respecting its practice, are 
incorporated herein by reference, and for convenience, are 
numerically referenced and grouped in the appended bibliography. 

Background of the Invention 

Yeast organisms naturally transport a small number of certain 
homologous proteins to, and sometimes through, the plasma membrane 
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as an essential contribution to cell surface growth and cell 
metabolism. As the cell buds as an incident of reproduction 
preparatory to formation of a daughter cell, additional proteins are 
required for formation of cell wall and plasma membrane as well as 

5 for metabolism* Some of these proteins must find their way to the 

site of function; hence, a secretory pathway is believed to exist 
(1). Certain homologous proteins involved in the above processes 
are formed by translation by ribosomes attached to the endoplasmic 
reticulum. Homologous proteins are those nornially produced ty the 

10 yeast species and required for its viability. Once formed, th^y 
migrate by transfer to Golgi apparatus, thence within vesicles to 
plasma membranes where some associate, or to some extent, penetrate 
into the space between the plasma membrane and the cell wall. A 
small number of homologous proteins seems to be exported completely 

15 through the cell wall, such as a-factor and killer toxin (2,3). 

Again, the bud region of the cell seems to be the site of 
attraction for the vesicles and by their fusion to the inner surface 
. of the bud they contribute to the overall grovrth of the plasma 

20 membrane, and presumably^ tte cell wall (4,5,6). It is 

controversial still whether glycosylation of the protein may assist, 
or is implicated. In the so-called secretory process. Further, by 
definition "secreted" proteins are believed to have a signal 
prepeptide, postulated to be associated with the transport or 

25 incorporation process at the membrane surface. However, the precise 

mechamsin involved in the overall secretory process is not fully 
understood. 

It was contemplated that recombinant DNA technology could 
30 provide valuable assistance in answering the open questions about 

the secretory process in yeast organisms and, given its proven 
applicability in enabling such, and other, organisms to produce 
copious quantities of heterologous polypeptide products endogenously 
(See, e.g., 7 to 17), in achieving appropriate manipulation of the 
35 yeast host so as to direct the secretion of heterologous protein in 
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discrete, mature fonn. This has, in fact, been achieved and is the 
subject of US 438,236 (EP 88632), 

supra. In that application is described the discovery that a 
heterologous protein, initially expressed as a pre-protein with its 
native signal or hybrid thereof, can be processed and secreted by 
yeast as a mature protein. 

Summary of the Invention 

This invention is based on the discovery that yeast organisms 
canT)e caused to produce, process and secrete protein that is 
nonnally heterologous to the yeast organism and not required for its 
viability, such that the protein can be obtained from the medium 
supporting the viable and reproducing yeast cells and in discrete 
form substantially unaccompanied by unwanted peptide presequence or 
other artifact of expression. For this purpose, a DNA sequence 
encoding the desired, heterologous protein is linked to the DNA 
sequence encoding the non-native {to the protein) signal sequence of 
yeast a-factor. Suitable yeast cells are transfonned with 
expression vehicles harboring such DNA encoding a heterologous 
protein operably connected to the such DNA coding for the a-factor 
signal (pre-pro) peptide and a promoter. Upon expression of the 
sequence encoding the heterologous protein together with that 
encoding a-factor signal peptide, the expression product is 
processed and the mature heterologous protein is exported into the. , 
medium of the cell culture, from which it can be removed with 
relative ease, without need to disnpt the viable yeast cells. It 
is thus recovered in otherwise substantially mature form for use, 
without the need to remove unwanted presequence or certain other 
artifacts of expression (e.g., the methionine attached to the 
otherwise first N-terminus amino acid which is an expressional 
consequence of the AUG translational start signal codon). Thus, the 
medium can be obtained in a form substantially free of viable or 
disrupted (i.e., lysed or otherwise broken) cells and, since it 
contains the desired product, is susceptible to more easily employed 
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purification techniques. Such product, after purification, is fit 
for use as intended. For example, human leukocyte interferon 
product finds use as a hirnian antiviral and/or antitumor agent (See, 
generally, 7 to 17). 

In summary, the present invention comprises the use of yeast 
alpha factor signal sequences and/or promoter to produce a protein 
normally heterologous to a yeast organism and not required for -its 
viability, in discrete fona unaccompanied by any substantial peptide 
presequence or other artifact of expression, as a product of yeast 
expression, processing and secretion. Further, this invention 
provides yeast cultures capable of producing such protein and 
resultant yeast culture media containing such protein as product. 
More specifically, the invention is directed to a process for 
producing heterologous proteins in yeast, and the expression 
vehicles and organiaas employed in this process, wherein the alpha 
factor promoter is used to effect the expression of the foreign 
gene*. Further, the invention is directed to the use of th^ signal 
(pre-pro> sequence for alpha factor to effect the processing and 
secretion of an expressed foreign protein, to a recanbinant 
expression vehicle effectively harboring the alpha factcor DMA 
sequences and to the cells transformed vrith such vehicles. 

By the terra "heterologous protein" as used herein is meant 
protein that is not normally produced by or required for viability, 
of a yeast organism. This term contemplates the fuixrtional 
insertion of DHA encoding such protein, via recombinant DNA 
technology, into an expression vehicle, in turn used to transform a 
yeast organism host. Functional insertion of DHA denotes the 
insertion of DNA encoding the heterologous protein into an 
expression vector under control of the a-f actor promoter and/or 
connected to the DHA sequence coding for the a-f actor signal to 
obtain a hybrid preprotein, i.e., one which comprises the a-factor 
signal peptide fused to the heterologous protein. Examples of such 
heterologous protein are hormones, e.g., human grovrth hormone. 
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bovine growth honnone, etc,; lymphokines; enzymes; interferons, 
e.g., human fibroblast, human immune and human and hybrid leukocyte 
interferons, bovine interferons etc.; viral antigens or immunogens, 
e.g., foot and mouth disease antigens, influenza antigenic protein, 
hepatitis core and surface antigens, etc,; factors incidental to 
grov/th, e.g* human insulin-like growth factor {IGF-1 and I6F-2), 
epidermal growth factor (EGF) and nerve growth factor (NGF) and 
various other polypeptides, e,g*, rennin, human serum albumin, -human 
insulin, various glycoproteins, etc. 

'"Secretion" as used herein means exportation of product through 
the plasma membrane and at least into or through the cell wall of 
the yeast organism into the medium supporting the cell culture. In 
this connection, it will be understood that in some instances, 
"secreted" product associates in some manner with the cell wall, 
perhaps necessitating a different purification procedure or a 
modification of the structure and function of the yeast host. 

"Processing" means the cellular cleavage of the a-f actor signal 
peptide from the mature protein so as to produce the heterologous 
protein unaccompanied by any substantial portion of the signal 
sequence or by extraneous peptide in — so-called discrete—mature 
form. By "extraneous" peptide is included peptide artifacts of 
expression such as methionine. Processing admits of cleavage of the 
signal polypeptide at a locus inconsequentially removed from the • 
precise point of signal peptide union with mature protein. 

Brief Description of the Drawings 

Fig. 1 illustrates the structure of pools of synthetic 
oligonucleotides used as Iiybridization probes to isolate the gene 
for a- factor. 

Fig. 2 illustrates the results of electrophoresis of DMA 
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fragments obtained using the probes of Fig. 1. 

Figs. 3 and 4 are the nucleotide sequences of a-f actor genes. 

. Fig, 5 illustrates the schene for joining the gene for human 
interferon D with the gene for the a-f actor pronoter and signal 
sequence. 

Fig. 6 illustrates the scherae for construction of a yeast 
^expression plasmid for expression of human interferon D (IFN-oj^). 

Fig. 7 depicts the protein and DMA sequence at the junction of 
the a-factor signal sequence and the modified IFN-aj^ gene. 

Fig. 8 shows the levels of IFH-a, in the nediura and cell 
5 1 

extracts of a culture of a yeast transforaant expressing IFH-a^^. 

Fig. 9 illustrates the scheme for construction of a 
yeast/E- coli shuttle vector for expression of heterologous genes 
Qusing the a-factor procnoter and signal polypeptide gene sequences. 

Fig. 10 illustrates the assembly of a yeast/£. coli plasmid for 
expression of tissue plasminogen activator, 

,g Fig. 11 depicts the construction used to effect the ^product! on 
of mature heterologbus' prbXeihs* thuman interferon illustrateffT as 
a product of expression with partial alpha factor signal sequence, 
processing of the alpha factor coraponnient and secretion of the 
mature protein into the supporting aediura. 

Fig. 12 illustrates the degree of consensus between the MFal and 
l^aZ polypeptides of figures 3 and 4. 



35 
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Detailed Description of a Preferred Embodiment 

The yeast Saccharomyces cerevisiae secretes only a limited 
number of proteins into the culture medium. One of the proteins 
that is found in the medium is a-pheromone or a-factor (2). Duntze 
and coworkers (18, 19) first determined that the a-factor is a 
family of four oligopeptides of 12-13 amino acid residues having the 
basic sequence 

H2K-(Trp)-His-Trp-Leu-Gln-Leu-Lys-Pro-Gly-Gln-Pro-Ket (or KetSO)-Tyr-COOH. i 

Figure 3 shows the location of the four peptides in the 
unprocessed product of one of the a-factor genes, MFo-1. Only the 
"boxed" segments are secreted into the medium, the remaining 
sequences are not. It is not clear how much of the remaining 
sequence is "true" signal (pre) sequence, which is at least 
partially processed to effect secretion, and how much is "pro" 
sequence in the sense of a traditional precursor protein (e.g., 
prorennin, proinsulin.) 

Similarly, only the "boxed" portions in product of the MFa2 
gene shown in Figure 4 are secreted, and the nature of the remaining 
sequences can be described analogously to that of those in MFal. \ 

During the reduction of this invention to practice, another | 
group (44) succeeded in isolating and sequencing one of the genes | 
for a-factor (KFol) by methods different from those disclosed | 
herein. As described below, two a-factor genes were isolated by us | 
and expression vectors in which the DMA sequence for the promoter 
and signal peptide of a-factor was inserted in tandem with the DMA 
sequence coding for heterologous protein were constructed from one 
of them. 

A. Bacterial and Yeast Strains 

E. coli K-12 strain 294 (endA thi"hsr"hsm'^)(ATTC 
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31446) (22) was used for bacterial transforroations. Yeast strain 
20B-12 (a, trpl pep4 ) deposited without restriction in the American 
Type Culture Collection, ATCC Ho. 20626, on March 5, 1982 was used 
as yeast host- 

B. Growth Media 

The routine yeast growth medium contained 1 percent 
Bacto-yeast extract, 2 percent Bacto-peptone and 2 percent 
dextrose. Yeast minimal medium contained 0.67 percent Bacto-yeast 
nitrogen base without amino acids, 2 percent dextrose and 3 percent 
gar. The minimal medium supplemented with IK sorbitol was used for 
yeast transformations. Bacterial growth aedium was LB (25) which 
was supplemented with 20 pg/ml ampicillin when used for 
transformation. S-agar plates used for colony screening contained 
per liter: 32g tryptone, 5g MaCl, 15g Difco agar and 0.2g NaOH to 
which ampicillin or chloramphenicol was added as indicated. 

C. Transf onnati ons 

coli 294 was transformed using a published procedure 
(23). Yeast were transforrod essentially as described (21, 24). 

D. Enzymes and DMA Preparations 

Restriction enzymes were purchased from Hew England Biolabs 
and Bethesda Research Laboratories and were used according to 
manufacturer's recommendations. T4 DHA ligase was from Hew England 
Biolabs and was used in 20h« Tris-HCl (pH 7.5), IftiM MgClg, ICmM 
dithiothreitol, ImM ATP at 14''C. Calf alkaline phosphatase was 
purchased from Boehringer Mannheim and was used in lOOmM NaCl, 50mM 
Tris-HCl (pH 7.4), lOmM HgSO-, ln*l 2-mercapto-ethanol at 37'C. 
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Plasmid DMAs were prepared by the cleared lysate method (29) 
and were purified by Bio-Rad Agarose A-50 column chromatography. 
Small amounts of plasmid DMAs from individual coli transfomants 
were prepared by a quick-screening procedure (20). DNA restriction 
fragments were isolated by electroelution from a 1 percent agarose 
gel followed by phenol /chloroform extraction and ethanol 
precipitation. Oligo-deoxynucleotide probes were prepared by .the 
phosphotri ester method (41). 

E. Design of the Hybridization' Probe 

The 15-mer oligonucleotide probes for the a-f actor gene were 
designed on the basis of the amino acid sequence of the pheromone 
(19) and yeast codon usage frequencies. The rationale is outlined 
in Fig. 1 where the last 5 amino acids of the a-factor and all the 
possible codons and their usage frequencies are given. (The codon 
usage is the total of 2 different glyceraldehyde-3-phosphate 
.dehydrogenase clones (30, 31) and of alcohol dehydrogenase 1.) The 
codon usage for these and other genes has recently been summarized 
(45), As can be seen from Fig. 1, virtually all possible sequences 
coding for the 5 amino acids are included in the oligonucleotide 



sequence 5'-GGiCAACCATGTAC. Accordingly, two pools consisting of 



tv/o oligonucleotides each, and complementary to the above sequence, 
"Were ^yrvthesi zed. No dther contiguous 5 amino acids in the 
pheromone could be covered with such a limited set of 
oligonucleotides . 

F. Screening of Recombinant Plasmids 

A genomic library, made by insertion of partially 
SauSA-diges ted yeast DNA into the BamHI site of YRp7 (32), was 
screened for presence of a-factor gene clones. E. coli 
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transformants were grown on nitrocellulose filter paper (Schleicher 

and Schuell, BA85) placed on S-agar plates containing 5 g/ml 

ampicillin. After 6 hours at 37**C, filters were transferred to 

S-agar plates containing 150 g/ml chloramphenicol. After 15 hours 

of amplification colonies were tested for hybridization using a 

32 

modified in situ colony screening procedure {38). P-labeled (40) 
synthetic oligonucleotides described above were used as hybridization 
probes. Filters were hybridized overnight at A2X in lOctfi Tris (pH 7,5), 
6mM EDTA, O.lmH ATP, ImH sodiun pyrophosphate, 0.8M MaCl , IX Denhardt's 
solution, 0,5 percent KP-40, and 0.1 mg/ml £. coli tRNA. Filters were 
washed 3 times for 20 rain, in 6XSSC at 30**. Dried filters were exposed to 
Kodak XR-2 X-ray film with Dupont Lightning-Plus intensifying screen at 
-80'. 

G. Identification of Recombinant Plasmids Containing the a-factor Gene 

Approximately 4500 bacterial colonies containing recombinant 

plasmids were tested for in situ hybridization (38) with 
32p 

-end-labeled oligonucleotide pool I (Fig. 1). Twenty-four 
plasmids hybridized to varying degrees. Small amounts of plasraid 
DMAs were prepared from these 24 colonies by the method of Bimboira 
and Doly (20) and tested for hybridization with the same probes 
after spotting the DNA samples on a nitrocellulose filter. Two of 
the 24 plasmids, designated as p51 and p52 respectively, lybridized 
strongly and were chosen for further study. The p51 and p52 
plasmids also hybridized with the oligonucleotide pool 
II. 

H. Subcloning of the Hybridizing Sequences 

To characterize the inserts that hybridized with the 
synthetic probes, plasraid DNA prepared from the p51 and p52 clones „ 
was subjected to restriction enzyme analysis with EcoR I, Sai l, 
Hindi! I, BamH I, and Pstl . As seen in Fig. 2A, the 2 recombinant 
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plasmids are quite dissimilar. Only EcoRI an PstI digestions of the 
tv;o plasmids yielded one common fragment each. In both cases the 
common fragment is the TRPl insert and the 1.38 PstI piece is the 

R 

DNA between PstI sites in the TRPl and the amp genes. 

The fragments that contained sequences complementary to the 
probe were identified by the method of Southern (42). Fig. 2B shows 
that, except in one case, digestion with all 5 restriction enzymes 
yielded a fragment that specifically hybridized with the probe. No 
hybridization was seen with any of the fragments produced by 
restriction of p52 DNA with Hindi!!. 

, , .The. smallest restriction fragments that contained sequences 

complementary to the synthetic probes were the 1.7 kbp EcoR I 
fragment from p52 and the 1.8 kbp Hi nd lll fragment from p51. These 
two DNA fragments were isolated from a preparative agarose gel by 
electroelution and separately ligated to appropriately cleaved 
plasmid pBR322 (33) DNA. The ligation mixture was used to transfonn 
E. coli 294 and the plasmid DNA from the transfonnants was analyzed 
by a quick-screen procedure (20). Two transfonnants, designated p53 
and p56, containing the 1.7 kbp EcoR I and 1.8 kbp Hind ! II fragment 
inserts, respectively, were analyzed as follows: Plasmid DNA was 
prepared from p53 an p56 and digested separately v/ith BamHI, Clal , 
Pvul , Pst! , and Sail . The resulting DNA fragments were separated on 
a 1 percent agarose gel, transferred to nitrocellulose filter paper 
(42) and' tested for ho'bridization with p-labeled probes. The 
analysis of the restriction digests and corresponding hybridization 

patterns of the p53 DNA, the recombinant plasmid containing the 1.7 
kbp yeast DNA as an EcoR I fragment; showed that the yeast DNA in 
this clone contained one Sail and two PstI sites and that the 
sequence complementary to the probes was included within a 0.5 kbp 
Pstl-Sall fragment. The Hind! 1 1 fragment of yeast DNA in the clone 
p56 lacked recognition sites for these enzymes, and the linearized 
plasmid, resulting from cleavage at single recognition sites for 
these en^mes in the pBR322 vector, hybridized with the probes. 
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This plasmid was then digested with a number of additional 
restriction endonucleases and the digests were analyzed by the 
method of Southern as described above. It was found that the 
hybridizing sequences in this plasmid were contained on a 1.3 kbp 
Hindlll-SacI fragment. 

The property of growth inhibition of "a" cells by a-f actor 
was used to test whether or not the pherooone gene contained in the 
cloned 1.7 kbp EcoR I and 1.8 kbp Hind i 1 1 fragments are functional. 
If an active a-factor pherooone gene were present in a plasmid, it 
would be expected significantly more pheromone would be synthesized 
in cells containing the multi-copy plasmid than in cells containing 
only the chromosomal copy (or copies) of the gene. The enhanced 
level of the a-factor could then be detected by an increase in the 
area of nongrowth in a lawn of responsive "a" cells. The 1.7 kbp 
fragment, isolated from fcoRI -digested p53 DMA, and the 1.8 kbp 
fragment, isolated from Hindi 1 1 -digested p56 DNA, were separately 
ligated to a pBR322-based vector plasmid which contained the yeast 
selectable marker TRPl and the yeast origin of replication from the 
2ura yeast plasmid (43). Yeast strain 20B-12 was separately 
transformed with these plasmids and with a control plasmid that 
lacked DNA sequences coding for the a-factor. The transforroants 
were then compared for pheroowne production. The transfonnants 
containing Vfal or HFa2 coding sequences on plasmids produced 
significantly more a-factor than the same strain -transformed with ■ 
the control plasmid. He concluded that the 1.7 kbp EcoR I (hFal) and 
1.8 kbp Hindlll (MFo2) fragrasnts ccxitain active a-factor pnercxtam 
genes. The result with J^al is consistent with that described by 
Kurjan and Herskowitz (44), as this gene corresponds to the gene 
described by them. 

I. DMA Sequence Determination 

DNA sequence determination was as previously described 
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(45). Briefly, DMA sequences were obtained by the chain termination 
method (47) using recombinant phages M13 mp8 and nip9 (39) as the 
source for single-stranded "template" DNA and a synthetic 
oligonucleotide for priming E, coli DNA polymerase I (large 
fragment, Boehringer Mannheim) in the presence of a- P dCTP (400 
Ci/mmole, Amersham). Reactions were electrophoresed on 5 percent 
polyacrylamide/8M urea "thin" gels (47). Gels were dried onto 3MM 
paper (Whatman) and exposed to X-ray film for 2 to 12 hr- 

The nucleotide sequences of large parts of the 1.7 kbp EcoRI 
fragment and the 1-3 kbp Hind i 1 1 -Sad fragment are shown in Fig. 3 
and Fig. 4, respectively. The p53 sequence contains an open reading 
frame coding for a protein of 165 amino acid residues v/hich carries 
4 internal repeat units within its C-terminal half. Each unit 
begins with Lys-Arg and ends with the a-factor sequence. Within 
each unit the pair of basic residues is separated from the 
a-factorby several Glu (or Asp)-Ala dipeptide repeats. The 
N-teVminal half of the protein starts with a highly hydrophobic 
sequence of 22 amino acids which probably represents a signal 
sequence for secretion. The 61 amino acid residues between this 
hydrophobic sequence and the first repeat unit include 3 possible 
recognition sites for M-glycosylation (indicated by bars in figs. 
3). The organization of the pheromone gene contained in p53 clone 
is identical to the MFa gene recently described by Kurjan and 
Herskowitz (44). This gene differs from VFal at.4 positions. It. 
contains T (instead of C) residues at positions -8 and -7» and 125 
and an A (instead of C) residue at position 604. Because of : the 
difference at position 125 there is a TTA (Leu) rather than TCA 
(Ser) codon at amino acid position 42* We have designated the gene 
contained in p53 as NFal. 

A different a-factor gene, MFa2, is present in the p56 
clone. The organization of this gene (Fig 4) is similar, but not 
identical, to the MFal. The a-factor encoded by this gene is 
apparently made as a precursor protein of 120 amino acid residues 
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containing two copies of the pberomone. One of the a-pheromone 
tridecapeptides contained in the putative precursor Is identical to 
the pheromone copies encoded by the gene, whereas the second 
copy contains a Gin — >Asn and a Lys — >Arg. 

The organization of these precursors is strikingly similar 
to that of certain raaranalian precursors for neuroendocrine 
peptides. Thus, like the propiocielanocortin (48, 49), proenkephalin 
(50-52), and prodynorphin (53), the yeast precursors contain 
multiple peptide units destined for secretion. In all these 
precursors the secreted unit is contained on the C-tenninal half of 
the precursor. The N-terainal half of the molecules carry possible 
glycosylation sites. As is the case for the raamraalian 
multifunctional precursors, glycosylation may be involved in the 
correct processing of the a-factor precursor. However, the actual 
processing steps for the yeast precursor seem to be unpredictably 
different from those of mamroallan precursor proteins. Whereas the 
pairs of basic residues (Lys-Arg) providing sites for release 
directly flank the secreted peptide in the raanroalian precursors, 
cleavage at these sites in the a-factor precursor would release the 
pheromone units with several additional amino acids at the 
H-teminus (see figs. 3 and 4). These H-temiinus extensions would 
consist of repeating -X-Ala- sequences in the precursors encoded by 
both MFal and VFa2 genes. Recent experiments (54, 55) indicate that 
the last step in the processing of the a-factor precursors is the- 
reraoval. of these sequences by dipeptidyl amino peptidases* The bee 
venom melittin (56) and the frog skin caerulein (57) precursors are 
apparently processed by similar mechanisms. 

J. Construction of a Plasroid for Expression and 
Secretion of Human Interferon 

Although, as discussed above, our DKA sequence data suggest 
that the a-factor is synthesized as precursor proteins of 165 and 
120 anujK) acids, to such proteins have been described. The 
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processing and secretion mechanism of a-factor is not known. Recent 
studies, however, with altered a-factor indicate that the last step 
in the production of mature a-factor is apparently the removal of 
the glu-ala or asp-ala units before the release of the a-factor 
oligopeptides of 12-13 amino acids having the basic sequence 
HgN-drp )-Hi s-Trp-Leu-Gl n-Leu-Pro-Gly-Gl n-Pro-Met (or 
MetS0)-Tyr-COOH. 

The preparation of a plasmid to demonstrate the usefulness 
of the o-factor promoter and the a-factor presequences for 
expression and secretion of heterologous gene products is outlined 
in Fig, 5. The DNA sequences coding for the a-factor presequences 
for expression and secretion of heterologous gene products is 
outlined in Fig. 5. The DNA sequences coding for the a-factor 
peptides were removed from one of the a-factor clones {p53)such 
that the resulting plasmid, p57, contained the promoter sequences 
and the sequence corresponding to 89 amino acids of the a-factor 
"prepro" protein. This sequence was then joined with human 
interferon D (IFN-aj) gene to form plasmid p58. The human 
interferon D gene (58) was modified such that DNA sequences 
corresponding to Leu-Glu-Phe had been added before the initiating 
methionine codon. After modified interferon D gene had been joined 
with the a-factor "prepro" and the promoter sequences, these 
sequences were isolated and inserted into a yeast-^. coli shuttle 
plasmid YEp9T (Fig. 6). The plasmid YEp9T had been previously made 
by replacing the EcoR I -Sall fragment in plasmid YEplPT (59) with the 
EcoRI-Sall fragment from pBR322. This plasmid contains the pBR322 
(33) DNA needed for its selection and replication in E^. col i . In 
addition, it contains the yeast TRPl gene on an EcoRI to PstI 
fragment from chromosome lY (34-35) and a yeast origin of 
replication on a PstI to EcoR I fragment from the endogenous 2m 
plasmid DNA. These two DNA fragments from yeast allow for its 
selection in yeast and for its autonomous replication and 
maintenance as a plasmid. The resulting plasmid, p50, with the 
indicated orientation oi" the insert was selected because the 2u 
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origin contains a transcription termination/polyadenylation signal 
(37). The DNA sequence at the junction of the a-factor "prepro" 
sequence and the modified LeIFN-D gene present in p60 is shown in 
Fig. 7. The p60 plastnid was introduced into the yeast strain 20B-12 
and the trp^ transfonnants were grown and assayed for interferon 
production. 



K. Interferon Assay of Grovfth Mediign and Cell Extracts 

Individual colonies of the transfonnants were grown at 30**C 
in 20 ml YNB+CAA to an AggQ of approximately 10. For assay 10 ml 
aliquot was centrifuged at 7K rpm for 10 minutes in a Sorval SK24 
rotor. Various dilutions of supernate (media) were assayed. The 
cells were resuspended in 0.5 ml 7W guanidine-HCl containing an 
equal volume of glass beads and vortexed for 2 minutes at high 
speed. Both the cell lysate and the medium were then diluted into 
PBS/BSA (150 ntfi NaCl, 20 nrt sodium phosphate (pH = 7.9), and 0.5 
percent bovine serum albumin) for bioassay. Extracts of yeast were 
assayed for interferon by comparison with interferon standards by 
the cytopathic effect (CPE) inhibition assay (26). Up to one 
hundred million units of interferon per liter of growth medium was 
found. The cell extracts also yielded interferon at the rate of 
100 X 10^ units per liter of culture. 

L. Purification of Interferon from the Medium 

A single colony of yeast strain p60/20B-12 v/as grown at SO^C 
in 500 ml YNB+CAA to an AggQ of 2.4. Five hundred ml of this 
culture was diluted to 5L with YWB+CAA to give an AggQ of 0.21; 
the resultant 5L culture was grown at 30*C until AggQ = 70. At 
this time the 5L culture was harvested by centrifugation at 7,000 
rpm for 10 minutes. Ten ml aliquots were withdrawn periodically 
during the fermentation to measure optical density, interferon 
production and secretion. Before assay, each aliquot was 
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centrifuged for 5 minutes in a bench-top refrigerated centrifuge to 
separate the cells from the medium. The medium and cells were 
assayed as described above (see Fig. 8). Two different 

fermentations v/ere done. .The peak activity of interferon in the 
9 ^ 

media were 3x10 and 2x10" units per liter, respectively. The 
interferon activity in the cell extracts were 1x10^ and 2x10^ 
units per liter of culture. 

One and a half liters of frozen medium v/ere concentrated and 
dialyzed against 25mM Tris, lOnW EDTA, pH 8.0 in a 2.5 liter Amicon 
stirred cell (Amicon 2000) using a YH-5 ultrafiltration membrane to 
a final volume of 116 ml. A sample of the retentate was sequenced 
directly. Another sample of the retentate was acetone precipitated 
and sequenced. 

One ml of the concentrated medium was precipitated with 4 ml 
acetone, spun in a mi crofuge and washed with acetone. The pellet 
was resuspended in 0.1 percent TFA and further purified by HPLC on a 
Synchropak RP-P column. The column was eluted with a linear 
gradient of 0 to 100 percent acetonitrile in 0.1 percent TFA in 60 
minutes. A 12 pg sample of purified IFN-oAD was chromatographed as 
a control. The peaks of absorbance at 280 nm were sequenced. 

M. N-terminal Amino Acid Sequence of Interferon ►from Growth Medium 

Sequence analysis was based on the Edman degradation (27). 
Liquid samples were introduced into the cap of a modified Beckman 
890B spinning cap sequencer. Polybrene™ was used as a carrier in 
the cap (28). Reagents used v/ere Beckraan's sequence grade 0.1 molar 
Quadrol buffer, phenyl -isothiocynate, and heptafluorabutyric acid. 
Norleucine was added during each cycle with the Quadrol buffer to 
serve as an internal standard. The presence of PTH-norleucine in 
each chromatogram aided in the identification of PTH amino acids by 
retention time. The amino acid sequence analysis showed only one 
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species of interferon molecule with the NHg-tenninal sequence 
HH2-Glu-Ala-Glu-Ala-Leu-Glu-Phe-Met. The Met results frora the 
start codon at the N terminus of the interferon gene, and thus the 
protein produced contains 7 extra amino acids, three from the 
construction, i.e., Leu-Glu-Phe (see figure 7) and 4 frcwn the 
presequence of a-factor, i.e., Glu-Ala-Glu-Ala. Tte polypeptide 
containing this 7 amino acid H-teroinal extension retains interferon 
activity* 

N. Expression and Secretion of Other Heterologous Gene Products 

In the process of testing thre utility of the a-factor 
promoter and "prepro'-sequence, restriction endonuclease sites were 
created at the end of the a-factor "prepro" -sequence (see Fig. 7) 
such that the prtwaoter and the "prepro" -sequence could be isolated 
as a portable restriction fragment. An appropriate plasmid could 
then' be constructed to test the efficacy of this expression and 
secretion system for any heterologous gene containing suitable 
"sticky" ends* For this purpose an expression plasmid p55, was 
constructed as shown in Fig. 9. This plasmid, like YEp9T, contains 
the origins of replication for E. coli and yeast as well as 
selective markers for selection in each; of these two organisms. It 
also contains a convenient EcoR l site for gene insertion so that any 
gene that is contained on an EcoR I fragment wher& the first codon of 
the gene is immediately preceded ty the EcoRI site could be tested 
for the synthesis and secretion of the corresponding protein. 

The plasmid p65 was partially digested with EcoR I the linear 
molecules isolated, and ligated with EcoR I fragments containing 
various genes. After transformation of £. col i , plasmids that 
contained the inserts in the appropriate orientation were selected. 
For expression the frags^nt must be inserted at the EcoR I site 
following the promoter with the 5' end of the gene connected to that 
site. This orientation creates the junction between the a-factor 
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signal sequence and the heterologous gene as previously shown for 
LeIF-D (see Fig. 7). 

Table I lists genes that have been thus tested with the 
a- factor prompter and signal sequence. 

TABLE I Expression and Secretion of Other Heterologous Genes 
Using the a-Factor Promoter and Signal Sequence 

Growth Products per Liter 



Gene 


Vessel 


Cellular 


Medium 


Human Interferon t 


•Shake-flask 


i05 units 


trace 


Human Serum Albumin 


Fermentor 


25 mg 


3 mg 


Bovine Interferon al 


Fennentor 


100x105 U 


200x106 u 


Bovine Interferon a2 


Fermentor 


400xlo6 U 


60xl06 U 


Tissue Plasminogen Activator Shake-Flask 


20Mg 


20yg 


Rennin 


Shake- Flask 


lOOpg 


trace 


Human Insulin- Like 








Growth Factor 


Fennentor 


1-5 mg 


3 mg 



The expression of the first 4 genes was achieved by the 
insertion of EcoRI fragments into p65 as described above. The genes 
were obtained by EcoRl digestion of plasmids containing them as 
described in US 438,128 (GB 2116566, EP 88622) (BoIEU) ; IB 297,380 
(GB 2105343, EP 73646) (HSA) ; and US 312,489 (GB 2107718, EP 77670) 
(yIFN) and elsevghere, e.g, in Interfercris edited by Merigan, et 

al. , Academic Press, Inc. (1982), Proceedings of the Symposium on 
"Chemistry and Biology of Interferons: Relationship to 
Therapeutics", held March 8-12, 1982, Squaw Valley, California; 
Lawn, et al. Nucleic Acids Research 9. 6103 (1981); Gray, et al^. , 
Nature 295 , 503 (1982). 

Because of the placement of the restriction sites in the 
t-PA and rennin genes it was not practical to construct expression 
plasmids directly as above, but a modified approach was taken. The 
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construction of the t-PA expression plasmid is illustrated in Fig. 
10 using plasmid pt-PAtrpl2 (60) to obtain the t-PA gene by excision 
with Xbal and Bglll. p8I (described in US 438,236 (EP 88632), supra ) 
and YEplS (43). The rennin expression plasmid was assembled in 

ganalbgous manner using the rennin gene obtained by Xbal - Bcl I 
excision of pRI (described in US 452,227, EP applicaticxi 83307841.3). 
These two plasraids contain the LEU2 gene for selection in yeast. 
Therefore an a leu2 yeast strain was used for transformation with 
these plasmids. The prorennin expression plasmid pRl was 

^^constructed by incubating EcoRl-Pst I cleaved r-Itmn plasmid with 5' 
and 3' segments in the presence of T4 ligase. y-IMM ts a pBR 322 
derived plasmid described in U.S. Appln. Serial Mo. 312,489, 
(GB 2107718, EP 77670) and Gray, et al., Siyra . 

The 5" end fragment was a ligation of a synthetic fragment 

15 

and a fragment derived froo clone PFLA, as described below. The 
synthetic fragment has the structure: 

[Met] [Ala] [Glu] [He] [Thr] 
d A A T T CAfGtiCAtiAAATA TZT A S 

GTACCGTCTTTATTGTTCCTAGd 
reading direction — > 

This sequence contains an EcoRI site at the "upstream end" an 
ATG start codon, followed by sequences coding for the first four 
amino acids in prorennin, and tenni nates in a BaraHl site. 

25 

The cloned fragment to which this synthetic fragment is ligated 
in order to fonn the 5* end of the gene comprises a Xma-Baniil 
fragment of approximately 440 base pairs derived from the 5' end of 
the gene. 

30 

The appropriate 440 bp fragment was obtained from cDNA derived 

from unfractionated mRNA, using as primer, dGATCCGTCGAATTCGG, i.e., 

the "primer probe". The cDHA foraed using this primer was size 

fractionated as set forth above and fragments having more than 1,000 
35 
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base pairs inserted into the PstI site of pBR322 for cloning. The 
resulting clones were selected using both Tth probe and the 
primer/probe as probes. Only colonies hybridizing with Both were 
selected. From 1,230 transfonned colonies, about 300 colonies were 
50btained which showed hybridization with both probes. These were 
examined for presence of the 5' portion of the prorennin sequence as 
f ol 1 ows : 

The results of a series of double digestions using Ava I-Pvu I, 
^QAval-BamHl, BgllBamHI, and Bgll-EcoRI were analyzed. Advantage was 
thus taken of the .known Pvu I and 3gl I sites, each 125 base pairs 
either side of the p3R322 Pst site utilized for insertion of the 
cDNA sequence- These digestions provide suitable fragments for 
analysis- 

15 

The desired clone, PFLA, was selected by analysis of acryl amide 
gel electrophoresis performed on the above double digests of mini 
preps prepared from the identified clones, Plasinids were then 
isolated From PFLA clone, double digested with BamH I and Xmal, and 
2othe 440 bp fragment recovered by gel electrophoresis. 

The "complete" 5' end v/as then created by a standard ligation 
reaction utilizing the synthetic fragment and the PFLA clona 
BamHI-Xmal fragment with T4 ligase followed by cleavage with Xmal 
25dnd EcoRI. The resulting ligated sequences were purified on 
acrylamide gel electrophoresis selecting for the appropriate 455 
base pair fragment. 

The 3' end fragment was prepared in an mariner analogous to that 
2QUsed to prepare the PFLA clone. cDNA containing >1000 bp formed 
from unfractionated messenger RNA using oligo-dT as primer, was 
cloned as above, and colonies selected with Tth probe. Approximately 
50 colonies resulted. The desired clone was selected by analyzing 
the results of gel electrophoresis formed on plasmid minipreps which 
35vere double digested with BaraHI/3glI, PvuI/BamHI, EcoRl/Bgll, and 
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PvuI/EcoR I again taking advantage of the Pvul and Bgll sites 
flanking the PstI insert site. The plasmids fro(B the desired colony 
were isolated then cleaved with Xraa I and Pst I and electophoresed 
to isolate the 800 bp sequence of the "3 - 375" fragment. The 
5 "3-375" fragment extends from the Xaa site congruent with that froa 
the PFLA fragoent, past the end of the gene to the Pst 1 insertion 
site. 

The transfonaiants carrying various expression plasraids were 
TOgrown in appropriate sedia. The cultures were fractionated into 
supernatants and cells. The supernatants (media) and'cell extracts 
were assayed for the expression and secretion of various gene 
products. The bovine interferon activity was assayed by comparison 
with interferon standards by cytopathic effects. The amounts of 
^gother prodiKTts in the eediua and cell extracts were detenained by 
radioimmunoassays. The values listed in the table are the peak 
activities. We have detenained that both cellular and secreted t-PA 
molecules possess biological activity. 

20 

0, Characterization of Secreted Bovine Interferon 

The bovine interferon-o^ secreted into culture mediiai has been 
purified and characterized as described below. 

25 

7 liters of culture ©ediiaa was adjusted to pH 8.02 with soditan 
hydrojcidte. This solution was then loaded onto a 2.5 x 18 cm l^igel 

ACA column pre-equilibrated with SftjiM tris, pH 8.0. After loading, 
the coluian was washed with SOort Tris, 1 percent (w/v) PEG 8000, pH 

3o8.0 until A^^q was approximately zero. The column was then eluted 
with lOOmM aianonium acetate, 2 percent (w/v) PEG 8000, pH 5.0 
followed by 20mH glycine, 2.5 percent (w/v) PEG 8000, pH 2.0. The 
majority of the interferon activity eluted in a single peak during 
the pH 5.0 elution. The pooled material from the Nugel column (83 

35iTiL, 150H units) was loaded onto a 2.5 x 5.0 cm SE-53 column 
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pre-equilibrated with 25mM ammonium acetate, pH 5.0. The interferon 
activity eluted in a single peak during the sodium chloride 
elution. This pool contained l^mg protein, 106M units in 11 mL. 
The purity of this material as judged by SDS PAGE was approximately 
80-90 percent. 

Ten ml of the SE-53 pool was applied to a 2.5 x 18 cm Sephacryl 
S-300 column equilibrated in 25mH sodium phosphate, pH 6.0. The 
column was eluted with this buffer and the interferon activity 
eluted in a single peak. This final pool contains 8 mg protein, 113 
H units in 17 mL. 

N-terminal sequence analysis of the protein present in the SE-53 
pool indicates that the bovine interferon secreted by yeast has been 
processed at three distinct sites. The three sites of processing 
and the relative amounts of each are as follows: 

-8 -6 -3 +1 

. - .ly s-arg-gl u-al a-gl u-al a-1 eu-gl u-phe-met-cy s-hi s-1 eu-pro-hi s. . . 

63 pet. 13 pet. 24 pet. / 

As shown above, met iirenediately precedes the N-tenninus of the 
bovine interferon; the short peptide extensions do not result in 
loss of interferon activity. 

Production and Secretion of Mature Heterologous Proteins 

In both cases (human IFN-a^, and bovine IFN-a^ ) where the 
N-terminal amino acid sequence of the secreted polypeptides was 
determined, the proteins contained extensions of 2 to 7 amino acids 
in addition to the initiating methionine. Although these 
polypeptides have biological activity, it would be preferable to 
produce and secrete into the growth medium proteins that are 
identical to the proteins from the natural sources. 
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In order to produce the interferon molecule that contains 
cysteine as the K-terainal araino acid (as is the case with the 
natural IFN-qj^) we needed to Eoodify the junction between the 
factor "pre-pro" sequence and the IFN-a^ gene such that the 
removal of the modified "pre-pro" sequence will result in release of 
a mature interferon molecule containing the natural N-tenainus. An 
outline of the procedure to obtain such a junction is shown In 
Figure. 11. A DHA fragment containing the MFal promoter and 
"pre-pro" sequence and the modified IFN-a^ gene was isolated and 
cloned into tte EcoRI site of Ml 3 rap8 (61). Single-stranded DMA 
template was prepared from the recombinant phag"e containing the 
insert in the appropriate orientation. This template was annealed 
with a phosphorylated oligonucleotide. The synthetic 
oligonucleotide is 24 bases long and is complanentary to 12 bases 
coding for leu-asp-lys-arg near the C-terminus of the "pre-pro" 
sequence and to 12 bases coding for cys-asp-leu-pro, the first 4 
amino acids of natural IFH-a^. This prioer-templated intenaediate 
was subjected to extension and ligation reaction at 23*C for 2 hours 
in the presence of 500 nMdATP, 100 nMdTTP, 100 mMdGTP, 100 niMdCTP, 
20 3«dATP, 3 units DHA polymerase (Klenow), and 400 units t4 D«A 
ligase in 10 m Tris pH7.4, 50 ra« MaCl and 10 n« Mg 504. Then 
additional 3 units of DHA polymerase (Klenow) and 400 units of T4 
BflA. ligase was added and mixture incubated for 2 hours at 23*C 
followed by incubation at 14'C for 15 hours. This mixture was used 
to transform E. coli JMlOl (62). The phage plaques were screened . 
for hybridization with the^^P-labeled oligonucleotide. Templated 
DNA from 2 positive reccntoinant phages was prepared and secjaenced 
using a primer complefaenrary to IFn-a^ DNA. Doable stranded DHA 
from one recombinant phage that contained the desired deletion 
(deletion of 24 nucleotides shown as a loop in Fig. 11) was 
prepared. The EcoR I fragment containing the modified junction 
between the "pro-pro" sequence and the IFM-a-j gene was isolated 
and ligated to EcoR I cleaved YEP9T. After transformation of E. coT 
with the mixture a plasraid (p75) with correct orientation of the 
insert was chosen for further study. 
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p76 DNA was prepared from E. coli and used to transform 20B-12 
strain (ATCC 20625) of yeast. Ten liter culture of one transforniant 
was grown and the culture medium was centrifuged to separate the 
medium from the cells. 

5 

500 ml of yeast medium v/as dialyzed into 25 nM Tris, pHS.O, 10 
wSA EDTA. The dialyzed media was then run through an immuno affinity 
column containing monocloried antibody to nature (LeIFA). After 
washing with 24 mM TRIS pH8.0, 10 mf^ EDTA, the intrferon activity 
^Qwas eluted with 0.2M acetic acid. The majority of interferon 
activity was found in Fraction Ho- 45. 200 m1 of this fraction was 
subjected to N-terminal amino acid sequence analysis as described 
before. The major sequence found was that of natural interferon D. 
The first 8 M-tenninal amino acids of the protein were: 
cy s-asp-1 eu-pro-gl u-thr-hi s-ser . 

15 

Additional Explanatory Motes re Figures 

Figure 2 
20 

Localization of homology between the a-factor probes and the DNA 

fragments from p51 (a) and p52 (b) recombinant plasmids. The two 

plasmids were digested with different restriction endonucleases and 

then electrophoresed on a one percent agarose gel. The DNA 

^(-fragments were transferred to nitrocellulose paper and hybridized to 
32 

P-labeled probes. Panel A: Ethidium bromide stained gel. 
Panel B: Southern blot. Lanes: 1, EcoRI; 2, Sail; 3, Hindlll; 4, 
SamHI; 5, Psti, The arrows indicate the two DNA fragments that were 
subcloned. The size standards were derived from lambda, YRp7, or 
^^BR322 Df4A. 

Figures 3 



Nucleotide sequence of MFa. gene and its nontranslated 5* and 
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3' flanking regions. The predicted amino acids sequence of the 
pheromone precursor is also shown. The nwnbers above and below the 
sequence denote the positions of araino acids and nucleotides, 
respectively. The four copies of the a-factor sequences are 
included in the boxed areas. The asterisks indicate differences in 
the nucleotide sequence in one or sore copies of the a-factor coding 
regions. Three potential M-glycosylation recognition sites are 
indicated by bars. This gene corresponds to the gene reported -by 
Kurjan, et al_. (44). 

Figure 4 

Nucleotide sequence of WaZ gene and Its nontranslated 5* and 3' 
flanking regions. The underlined araino acids indicate differences 
between the two pheromone copies encoded ty the MFa2 gene. See 
Fig. 3 for other details. 

Figure 5 

Joining of the IFN-a^ gene with the a-factor proraoter and the 
a-factor presequence. Since, as stown in Fig. 3, the 1.8 kbp EcoRI 
fragment contained the prraoter, the entire DNA sequence 5" to the 
sequences coding for the mature a-factor was joined to the modified 
IFH-aj^ gene such that tte a-factor presequence and the IFN-aj^ 
protein would be synthesized as a single precursor protein using the 
a-factor promoter. 

Figure 7 

The protein and DKA sequence at the junction of a-factor 
"prepro" and the oodified IFM-a^^ genes. The Xbal and EcoR I sites 
at the junction are indicated. 
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Figure 8 

Cellular and medium IFN-a^ levels during fennentation. At 
various time intervals 10 ml culture was removed from the fennentor. 
centrifuged to separate the cells and medium. Cell extracts were 
prepared as described above. Interferon levels in the mediuin and 
the extracts were detennined. 



10 Figure 9 

Construction of a yeast/E. coli shuttle vector plasmid for the 
expression of heterologous genes using the a-f actor promoter and 
signal sequences. The 1.12 kbp EcoR I fragment containing the 
15 promoter and signal sequence was isolated from p58 and was inserted 
into the EcoR I site of Y£p9T. 



• Figure 10 

20 

Assembly of plasmid p68 for expression and secretion of tissue 
plasminogen activator. 



25 Figure 11 



30 



Scheme for in vitro deletion mutagenesis. The 24 nucleotides 
that v/ere deleted at the junction of MF21 "pre-pro" sequence and the 
modified IFM-al gene are shown as a loop in the figure. 



Figure 12 



35 



Comparison of amino acid sequences of the putative a-factor 
precursors encoded by flFal and MFa2 genes. Various gaps were 
created to align the sequences with maximum horology. 
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Claims: 



1. A process for obtaining a protein heterologous to yeast 
as a product of yeast expression, which comprises: 

transfonning a yeast organism with a yeast expression 
vehicle comprising the DMA sequence of the promoter for yeast alpha 
factor operably connected to a DNA sequence encoding a protein . 
heterologous to the yeast organism; 

culturing the transfonned organism; and' 
recovering the protein from the culture. 

2. A process for obtaining a protein heterologous to yeast 
as a product of yeast expression, which process comprises: 

transforming a yeast organism with an expression vehicle 
comprising the DNA sequence encoding substantially the pre-pro 
peptide of yeast alpha factor operably connected in translation 
reading frame to a DNA sequence encoding a mature protein 
heterologous to the yeast organism; 

culturing the transfonned organism; and 
recovering the protein from the culture. 



3. A process for obtaining a protein heterologous to yeast 
as a product of yeast expression, processing and secretion, which 
process comprises: 

transfonning a yeast organism with an expression vehicle 
comprising the DNA sequence of the promoter operably linked to 
substantially the pre-pro peptide sequence of the yeast alpha factor 
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gene which is operably connected in translation reading frame to a. 
DMA sequence encoding a mature protein heterologous to the yeast 
organism; 

culturing the transformed yeast organism; and 
recovering protein from its supporting medium- 

4. A process for secreting a protein heterologous to yeast 
into the supporting medium, which process comprises: 

transforaing a yeast organism with an expression vehicle 
comprising the DMA sequence encoding substantially the pre-pro 
peptide of yeast alpha factor, operably connected in translation 
reading frame to a DNA sequence encoding a mature protein 
heterologous to the yeast organism; and 

culturing the transfonned organism. 

5. A process of Claim 4 wherein said DNA sequences are 
under the control of alpha factor promoter. 

6. A yeast expression vehicle comprising the DNA sequence 
of the promoter of the yeast alpha factor gene operably connected to 
a DNA sequence encoding a protein heterologous to the yeast organism. 

7. Pn expression vehicle of Claim 6 which also includes 
the DHA segaence encoding substantially the pr&-pro peptide of yeast 

alpha factor operably linked in translation reading frame upstream 
to the DNA sequence encoding a mature protein heterologous to the 
yeast organism. 



0384L 



-35- 



01 23544 



8. A yeast expression vehicle comprising the DNA sequence 
encoding substantially the pre-pro peptide of yeast alpha factor 
gene operably connected in translation reading frame to a DNA 
sequence encoding a mature protein heterologous to the yeast 
organism. 

9. ^ expression vehicle of Claims 6 to 8 wherein the 
DNA encoding heterologous protein s.a.g. encodes for a protein 
selected from the group consisting of human interferon, bovine- 
interferon, tissue plasminogen activator, and rennin. 

10. An expression vehicle of Claims 6 to 8 wherein 
the DNA encoding heterologous protein encodes for insulin-like 
growth factor. 

11* A yeast organism transformed with an expression 
vehicle of any one of Claims 6 to 10. 

12. A yeast organism c^>able of producing mature 
heterologous protein in the supportive medium, as a product of 
expression with a N-terminus pre-seguence derived from yeast 
alpha factor DNA, processing of said pre-seguence and secretion 
of the mature protein into the iredium. 

13. The organism of Claim 12 v^erein the mature 
heterologous protein is human insulin-like growth factor. 
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Carboxy terminus 
of ff-factor; 



Possible codons 
and their usage 



Gly 



Fig.1 . 



Gin — Pro 



Met — Tyr COOH 



GGU (90) CAA (20) CCA (32) AUG (20) UAC (33) 

GGC (3) CAG (0) ecu (3) UAU (0) 

GGA (0) CCC (1) 

GGG (0) CCG (0) 



Consensus 
oligonucleotides : 



> ' -GG JcAACC^TGTAC 



Synthesized 
oligonucleotide 
pools complemen- 
tary to above: 



1 . 5 ' -GTACATTGGTTGtcC 

G 

II . 5 ' -GTACATAGGTTG^C 



Fig. 7 



ALPHA-FACTOR "PRE-PRQ- SEQUENCE 

. • .LYS ARG GLU ALA GLU ALA 

• • .AAA AG A GAG GCT GAA GCT 

• ..TTT TCT CTC CGA CTT CGA 



MODIFIED IFN-ALPHAi GENE 
LEU GLU PRE MET... 
CTA GAA TTC ATG. . . 



1 




Xbal 






EcoRI 
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CGACAGTAAA7pTGCC{yW\mAATAGCTTCTACTGAAAAACAGTGGACCATGTGAAAAGAT^ 

-280 -260 -240 .220 

ACACATAATAnCMGT6AGCCnACTTCAAnGTAnGAAGT6CAAGAAAAC(^^ 

-200 -180 -IFO -170 

AGTACATATATAAGAGGGCamGnCCCATCAAAAATGnACTGnCTTACGA^ 

"120 -ffO -^0 



CAAACAAGAAGAnACmCTATCAAmmACACAATATW^ {J?G A6A TTT CCT TCA ATT 

-40 -20 1 

20 mm^m^^m 

Phe Thr Ala Val Leu Phe Ala Ala Ser Ser Ala Leu Ala Ala Pro Val Asn Thr Thr Thr 
TTT ACT 6CA GH TTA TTC 6CA 6CA TCC TCC GCA TTA Ga GCT CCA G?C S 35 2X 2a 

40 60 

40 

Glu Asp Glu Thr Ala Gin He Pro Ala Glu Ala Val He Gly Tyr Leu Aso Leu Glu Civ 
GAA gat GAA ACG GCA CAA ATT CCG GCT 6AA 6a 6TC ATC 6GT {a^ m ^T TO 6M GK 
^ 100 120 

Asp Phe Asp Val Ala Val Leu Pro Phe Ser A^T^jnT? Asn Asn Gly Leu Leu Phe Hp 
GAT TTC GAT GTT GCT 6TT TTG CCA TH TCC AAC AGC ACA AAT AAC 6GG TO TT6 TTT AtI 
140 160 180 

■■HMH^M 80 

Asn Thr Thr He Ala Ser He Ala Ala Lys Glu Glu Gly Val Ser Leu Asd Lvs Am gi.. 

*So '^^^ '^^ *^ S ^'^ ^'j^F *^ 



Ala Glu Ala 
GCT GAA GCT 

260 



100 

Trp His Trp Leu Gin Leu Lys Pro Gly Gin Pro Met Tyr 
TGG CAT TG6 TT6 CAA CTA AAA CCT 6GC CAA CCA ATG TAC 
***** 



280 



300 



Lys Arg Glu Ala 
AAG AGA GAA GCC 



Glu Ala Glu Ala 
GAA GCT GAA GCT 

320 



120 

Trp His Trp Leu Gin Leu Lys Pro Gly Gin Pro Met Tyr 
TGG CAT TGG CT6 CAA CTA AAG CCT 6GC CAA CCA ATG TAC 



340 



360 



Lys Arg Glu 
AAA AGA GAA 



Ala Asp Ala Glu Ala 
GCC GAC GCT GAA GCT 

380 



140 

Trp His Trp Leu Gin Leu Lys Pro Gly Gin Pro Met Tyr 
TGG CAT TGG CTG CAA CTA AAG CCT GGC CAA CCA ATG TAC 



400 



Lys Arg 
AAA AGA 



420 



Glu Ala Asp Ala Glu Ala 
GAA GCC GAC GCT GAA Ga 

440 



Trp His Trp Leu Gin Leu Lys Pro Gly Gin Pro Met Tvr 
TGG CAT TGG TTG CAG TO AAA CCC GGC CAA CCA ATG TAC 



460 



End 
TAA 



480 



rcCCGAaGATAACAACAGTaAGATGTAACAAAGTCGACm^ 



5l0 



STO 



5^0 



ATOTTTTCATnCTCCGTAAACMCATGTTTTCCCATGTAATATCCnnCTATTTTTGG^^ 



600 



6?0 



6T0 



ACATACTTTATATAGCTATTCACnCTATACACTAAAAAACTAAGACAATTTTAATTTTGCTGCCTGCCATAnTD^ 
660 680 71^0 7?o 

TTGTTATAAATTCCTATAATTTATCmTTAGTAGCTAAAAAAAGA^ 



7^0 



7^ 



8^0 
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TTCnCAmGTACATCAATGCCAGCAACGATGTGCGCATCTGGGCGACGCCTGTAGTGATTGTTTTCAAGGTATCGAG 
"3irO -2l0 -2^0 -270 

CCAAACTATTCATCGTTACTGTnCAAATAnCAGTTGTTTCAGTACAGAGTCGCCGTGGACCTAGTGAAACnGGTGT 
-2?0 -^0 -iSO .1^0 

CnTACAGCGCAGAGACGAGGGCnATATGTATAAAAGCTGTCCTTGATTCTGGTGTAGTnGAGGTGTCCnCCTATA 
-170 -170 -ffO 

TCTGTTmATATTCTATATAATGGATAAnACTACCATCACCTGCATCAAATTCCAGTAAAn 
-60 -40 -20 



20 



1 10 

Met Lys Phe He Ser Thr Phe Leu Thr Phe He Leu Ala Ala Val Ser Val Thr Ala Ser 

AT6 AAA TTC ATT TCT ACC ITT CTC ACT TTT AH TTA GC6 GCC GTT TCT GTC ACT GCT AGT 
1 20 40 60 

30 40 
Ser Asp Glu Asp lie Ala Gin Val Pro Ala Glu Ala He He Gly Tyr Leu Asp Phe Gly 
TCC GAT 6AA GAT ATC GCT CAG 6TG CCA GCC GAG GCC ATT AH GGA TAC TTG GAT TTC GGA 

80 100 120 

50 mam^^mm 60 

Gly Asp His Asp He Ala Phe Leu Pro Phe Ser Asn Ala Thr Ala Ser Gly Leu Leu Phe 
GGT GAT CAT GAC ATA GCT TTT TTA CCA TTC AGT AAC GCT ACC GCC AGT G6G CTA TTG TTT 

140 160 180 

^^■i" 70 MMHMI 80 

He Asn Thr Thr He Ala Glu Ala Ala Glu Lys Glu Gin Asn Thr Thr Leu Ala Lys Arg 
ATC AAC ACC ACT AH GCT GAG GC6 GCT GAA AAA GAG CAA AAC ACC ACT TTG GCG AAA AGA 

200 220 240 



Glu Ala Val Ala Asp Ala 
GAG GCT Gn GCC GAC GCT 



90 

Trp His Trp Leu Asn Leu Arg Pro Gly Gin Pro Met Tyr 
TG6 CAC TGG HA AAT TTG AGA CCA GGC CAA CCA AT6 TAC 
* * * 



260 



280 



100 

Lys 

AAG 



300 



Arg Glu Ala Asn Ala Asp Ala 
AGA GAG GCC AAC GCT GAT GCT 



110 



120 



Trp His Trp Leu Gin Leu Lys Pro Gly Gin Pro Met Tyr 
TGG CAC TGG HG IM CTC TM CCA GGC CAA CCA ATG TAC 



320 



340 



360 



End 

TGA AAAATGACCCTAAACTACnCTAAACCCTCTCGAnTCTnCACGTTCATACAACACCTAGrrTTATTTATTTTC 
3®) 400 470 



TTTTCAATCTGAGTAGTTGAGnnCGATCACTCACATAGAACTATnnTGCCAHTAAATAAAGTATTCTCTCAAAT 
470 4^0 4^0 STO 

GATGCGATACTATAATACTCTHGCCATATAnACATTCAnCATAAATAGGCTATGinCTATATCCGmCCGATTC 
5?0 570 5^^0 5^0 

TGTCTGCAAGCAAGGTTCCCTATCATTACCGGAnGTTCACTATGGnCGAGCTC 
elJO 670 670 
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EcoR\ 



p58 



Fig.9. 



EcoR\ 

Isolate 1240 bp 
fragment 

EcoRl 



a-foctor promoter and 
signot sequence 



EcoRi 



EcoRl 




EcoR\ digested YEpT9 



T 



T4 DNA Ligase 



Yeast 2// origin cv^pi 
of replication 



Yeast TRP I 
gene 




o(-foctor promoter and 
signoi sequence 



EcoR\ 




S/li 
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lo/ii 




BamWX 



Partiol £coRI 
digestion , 

Isolate ISOObp ' 
fragment 



;CTCt 



A 
G 
T 



A 
A 
T 



— TCTTTGGATAAAAGA TGTGATCTCCCTGAG — 
3* H0-AACCTATTTTCTACACTAGAG6GA-P 5' 




DNA polymerase (Klenow) 
T4 DNA ligase 
dNTPs, ATP 



MFa\ promoter 
ond 

"pre- pro" sequence 



modified 
gene 



I230bp 



EcoYKX 



570bp 



I 



□gate to £cc?Ri-cut Mi3mp8 ds DNA 
Transfornr> E coli 

Determine orientation of insert in 
recombinant phage (white plaques) 

Prepare single-stranded template DNA 
from phage 

Anneal with synthetic oligonucleotide 



Fig.11. 



heteroduplex 
product 



EcQ//^ Screen ploques with ^^P- labeled 
mutagenic oligonucleotide 
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